Determining Genetic Causal Variants Through Multivariate Regression Using Mixture Model Penalty

نویسندگان

  • V. S. Sundar
  • Chun-Chieh Fan
  • Dominic Holland
  • Anders M. Dale
چکیده

With the availability of high-throughput sequencing data, identification of genetic causal variants accurately requires the efficient incorporation of function annotation data into the optimization routine. This motivates the need for development of novel methods for genome wide association studies with special focus on fine-mapping capabilities. A penalty function method that is simple to implement and capable of integrating functional annotation information into the estimation procedure, is proposed in this work. The idea is to use the prior distribution of the effect sizes explicitly as a penalty function. The estimates obtained are shown to be better correlated with the true effect sizes (in comparison with a few existing techniques). An increase in the positive and negative predictive value is demonstrated using Hapgen2 simulated data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Model selection based on logistic regression in a highly correlated candidate gene region

Our aim is to develop methods for identifying a (causal) variant or variants from a dense panel of single-nucleotide polymorphisms (SNPs) that are genotyped on the evidence of previous studies. Because a large number of SNPs are in close proximity to each other, the magnitude of linkage disequilibrium (LD) plays an important role. Namely, highly correlated SNPs may hamper standard methods such ...

متن کامل

Determining Effective Factors on Forest Fire Using the Compound of Multivariate Adaptive Regression Spline and Genetic Algorithm, a Case Study: Golestan, Iran

Determining Effective Factors on Forest Fire Using the Compound of Multivariate Adaptive Regression Spline and Genetic Algorithm, a Case Study: Golestan, Iran    Pahlavani, P., Assistant professor at School of Surveying and Geospatial Engineering, College of Engineering, University of Tehran Raei, A., PhD Candidate of GIS at School of Surveying and Geospatial Engineering, College of Engineeri...

متن کامل

Determining Effective Factors on Forest Fire Using the Compound of Multivariate Adaptive Regression Spline and Genetic Algorithm, a Case Study: Golestan, Iran

Determining Effective Factors on Forest Fire Using the Compound of Multivariate Adaptive Regression Spline and Genetic Algorithm, a Case Study: Golestan, Iran    Pahlavani, P., Assistant professor at School of Surveying and Geospatial Engineering, College of Engineering, University of Tehran Raei, A., PhD Candidate of GIS at School of Surveying and Geospatial Engineering, College of Engineeri...

متن کامل

Penalized-regression-based multimarker genotype analysis of Genetic Analysis Workshop 17 data

Testing for association between multiple markers and a phenotype can not only capture untyped causal variants in weak linkage disequilibrium with nearby typed markers but also identify the effect of a combination of markers. We propose a sliding window approach that uses multimarker genotypes as variables in a penalized regression. We investigate a penalty with three separate components: (1) a ...

متن کامل

Penalized regression approaches to testing for quantitative trait-rare variant association

In statistical data analysis, penalized regression is considered an attractive approach for its ability of simultaneous variable selection and parameter estimation. Although penalized regression methods have shown many advantages in variable selection and outcome prediction over other approaches for high-dimensional data, there is a relative paucity of the literature on their applications to hy...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2018